Beyond the Zipf-Mandelbrot law in quantitative linguistics

نویسنده

  • Marcelo A. Montemurro
چکیده

In this paper the Zipf-Mandelbrot law is revisited in the context of linguistics. Despite its widespread popularity the Zipf–Mandelbrot law can only describe the statistical behaviour of a rather restricted fraction of the total number of words contained in some given corpus. In particular, we focus our attention on the important deviations that become statistically relevant as larger corpora are considered and that ultimately could be understood as salient features of the underlying complex process of language generation. Finally, it is shown that all the different observed regimes can be accurately encompassed within a single mathematical framework recently introduced by C. Tsallis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Zipf–Mandelbrot law, f-divergences and the Jensen-type interpolating inequalities

Motivated by the method of interpolating inequalities that makes use of the improved Jensen-type inequalities, in this paper we integrate this approach with the well known Zipf-Mandelbrot law applied to various types of f-divergences and distances, such are Kullback-Leibler divergence, Hellinger distance, Bhattacharyya distance (via coefficient), [Formula: see text]-divergence, total variation ...

متن کامل

A Simple LNRE Model for Random Character Sequences

This paper describes a population model for word frequency distributions based on the Zipf-Mandelbrot law, corresponding to the word frequency distribution induced by a random character sequence. The model, which has convenient analytical and numerical properties, is shown to be adequate for the description of language data extracted by automatic means from large text corpora. It can thus be us...

متن کامل

Is space a word, too?

For words, rank-frequency distributions have long been heralded for adherence to a potentiallyuniversal phenomenon known as Zipf’s law. The hypothetical form of this empirical phenomenon was refined by Ben̂ıot Mandelbrot to that which is presently referred to as the Zipf-Mandelbrot law. Parallel to this, Herbert Simon proposed a selection model potentially explaining Zipf’s law. However, a signi...

متن کامل

On a General Theorem of Number Theory Leading to the Gibbs, Bose–Einstein, and Pareto Distributions as well as to the Zipf–Mandelbrot Law for the Stock Market

The notion of density of a finite set is introduced. We prove a general theorem of set theory which refines the Gibbs, Bose–Einstein, and Pareto distributions as well as the Zipf law.

متن کامل

Minimum cost and the emergence of the Zipf-Mandelbrot law

This paper illustrates how the Zipf-Mandelbrot law can emerge in language as a result of minimising the cost of categorising sensory images. The categorisation is based on the discrimination game in which sensory stimuli are categorised at different hierarchical layers of increasing density. The discrimination game is embedded in a variant of the language game model, called the selfish game, wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره cond-mat/0104066  شماره 

صفحات  -

تاریخ انتشار 2001